Comparative Analysis of Serine/Arginine-Rich Proteins across 27 Eukaryotes: Insights into Sub-Family Classification and Extent of Alternative Splicing
نویسندگان
چکیده
Alternative splicing (AS) of pre-mRNA is a fundamental molecular process that generates diversity in the transcriptome and proteome of eukaryotic organisms. SR proteins, a family of splicing regulators with one or two RNA recognition motifs (RRMs) at the N-terminus and an arg/ser-rich domain at the C-terminus, function in both constitutive and alternative splicing. We identified SR proteins in 27 eukaryotic species, which include plants, animals, fungi and "basal" eukaryotes that lie outside of these lineages. Using RNA recognition motifs (RRMs) as a phylogenetic marker, we classified 272 SR genes into robust sub-families. The SR gene family can be split into five major groupings, which can be further separated into 11 distinct sub-families. Most flowering plants have double or nearly double the number of SR genes found in vertebrates. The majority of plant SR genes are under purifying selection. Moreover, in all paralogous SR genes in Arabidopsis, rice, soybean and maize, one of the two paralogs is preferentially expressed throughout plant development. We also assessed the extent of AS in SR genes based on a splice graph approach (http://combi.cs.colostate.edu/as/gmap_SRgenes). AS of SR genes is a widespread phenomenon throughout multiple lineages, with alternative 3' or 5' splicing events being the most prominent type of event. However, plant-enriched sub-families have 57%-88% of their SR genes experiencing some type of AS compared to the 40%-54% seen in other sub-families. The SR gene family is pervasive throughout multiple eukaryotic lineages, conserved in sequence and domain organization, but differs in gene number across lineages with an abundance of SR genes in flowering plants. The higher number of alternatively spliced SR genes in plants emphasizes the importance of AS in generating splice variants in these organisms.
منابع مشابه
Genome-wide analysis of alternative splicing landscapes modulated during plant-virus interactions in Brachypodium distachyon.
In eukaryotes, alternative splicing (AS) promotes transcriptome and proteome diversity. The extent of genome-wide AS changes occurring during a plant-microbe interaction is largely unknown. Here, using high-throughput, paired-end RNA sequencing, we generated an isoform-level spliceome map of Brachypodium distachyon infected with Panicum mosaic virus and its satellite virus. Overall, we detected...
متن کاملPlant serine/arginine-rich proteins and their role in pre-mRNA splicing.
Pre-messenger RNA (pre-mRNA) splicing, a process by which mature mRNAs are generated by excision of introns and ligation of exons, is an important step in the regulation of gene expression in all eukaryotes. Selection of alternative splice sites in a pre-mRNA generates multiple mRNAs from a single gene that encode structurally and functionally distinct proteins. Alternative splicing of pre-mRNA...
متن کاملAlternative splicing of pre-mRNAs of Arabidopsis serine/arginine-rich proteins: regulation by hormones and stresses.
Precursor mRNAs with introns can undergo alternative splicing (AS) to produce structurally and functionally different proteins from the same gene. Here, we show that the pre-mRNAs of Arabidopsis genes that encode serine/arginine-rich (SR) proteins, a conserved family of splicing regulators in eukaryotes, are extensively alternatively spliced. Remarkably about 95 transcripts are produced from on...
متن کاملSerine/arginine-rich splicing factors belong to a class of intrinsically disordered proteins
Serine/arginine-rich (SR) splicing factors play an important role in constitutive and alternative splicing as well as during several steps of RNA metabolism. Despite the wealth of functional information about SR proteins accumulated to-date, structural knowledge about the members of this family is very limited. To gain a better insight into structure-function relationships of SR proteins, we pe...
متن کاملImplementing a rational and consistent nomenclature for serine/arginine-rich protein splicing factors (SR proteins) in plants.
Growing interest in alternative splicing in plants and the extensive sequencing of new plant genomes necessitate more precise definition and classification of genes coding for splicing factors. SR proteins are a family of RNA binding proteins, which function as essential factors for constitutive and alternative splicing. We propose a unified nomenclature for plant SR proteins, taking into accou...
متن کامل